MiNgMatch—A Fast N-gram Model for Word Segmentation of the Ainu Language
نویسندگان
چکیده
منابع مشابه
Chinese Word Segmentation with Maximum Entropy and N-gram Language Model
This paper presents the Chinese word segmentation systems developed by Speech and Hearing Research Group of National Laboratory on Machine Perception (NLMP) at Peking University, which were evaluated in the third International Chinese Word Segmentation Bakeoff held by SIGHAN. The Chinese character-based maximum entropy model, which switches the word segmentation task to a classification task, i...
متن کاملPart-of-speech n-gram and word n-gram fused language model
In this paper, an accurate and com pact language m odel is proposed to cope robustly with data sparseness and task dependencies. This language m odel adopts new categories which are generated by continuously interpolating POS word-class categories and word categories using M AP estimation. Thenew categories can reflect word statistics efficiently without loosing accuracy and task-independent ge...
متن کاملFast Neural Network Language Model Lookups at N-Gram Speeds
Feed forward Neural Network Language Models (NNLM) have shown consistent gains over backoff word n-gram models in a variety of tasks. However, backoff n-gram models still remain dominant in applications with real time decoding requirements as word probabilities can be computed orders of magnitude faster than the NNLM. In this paper, we present a combination of techniques that allows us to speed...
متن کاملthe use of appropriate madm model for ranking the vendors of mci equipments using fuzzy approach
abstract nowadays, the science of decision making has been paid to more attention due to the complexity of the problems of suppliers selection. as known, one of the efficient tools in economic and human resources development is the extension of communication networks in developing countries. so, the proper selection of suppliers of tc equipments is of concern very much. in this study, a ...
15 صفحه اولMinimum Perfect H Fast N-gram Language
A new technique is proposed for N-gram language model (LM) retrieval based on minimum perfect hashing (MPH). A hierarchical data structure is used to store N-gram scores in hash tables according to the order of N-grams, and a LM score is retrieved by probing the appropriate hash table slot without collision. Both integer key and character-string key based MPH functions are studied. The proposed...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information
سال: 2019
ISSN: 2078-2489
DOI: 10.3390/info10100317